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Abstract 

Source coding theorems and Shannon rate-distortion functions were studied for the 
discrete-time Wiener process by Berger and generalized to nonstationary Gaussian autore- 
gressive processes by Gray and by Hashimoto and Arimoto. Hashimoto and Arimoto pro- 
vided an example apparently contradicting the methods used in Gray, implied that Gray's 
rate-distortion evaluation was not correct in the nonstationary case, and derived a new 
formula that agreed with previous results for the stationary case and held in the nonsta- 
tionary case. In this correspondence it is shown that the rate-distortion formulas of Gray 
and Hashimoto and Arimoto are in fact consistent and that the example of of Hashimoto 
and Arimoto does not form a counter example to the methods or results of the earlier pa- 
per. Their results do provide an alternative, but equivalent, formula for the rate-distortion 
function in the nonstationary case and they provide a concrete example that the classic 
Kolmogorov formula differs from the autoregressive formula when the autoregressive source 
is not stationary. Some observations are offered on the different versions of the Tocplitz 
asymptotic eigenvalue distribution theorem used in the two papers to emphasize how a 
slight modification of the classic theorem avoids the problems with certain singularities. 



1 Introduction 

A Gaussian autoregressive source is defined by the difference equation 

v _ \ -J2k=i a kX n -k + Z n n = l,2, ■■■ 

Xn -\0 n<0 (1) 

where the Z n are iid random variables with mean zero and variance a 2 and where we require 
that 

oo 

'Y] |o fc | < oo (2) 
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for ao = 1. If the roots of A(z) = Sfc^o ^ ^ au ^ e strictly inside of unit circle, then the 
statistics of the process approach a stationary distribution and the Shannon rate-distortion 
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function of the process is given parametrically by Kolmogorov's classic formula [3] (see also [5] ) 
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Berger [T] proved a source coding theorem for the special case of a nonstationary autore- 
gressive process with a\ = 1 and = for > 1 and he showed that the Kolmogorov formula 
still provided the rate-distortion function in this case. Gray [2] subsequently proved a source 
coding theorem for the general case described above and derived a rate-distortion function for 
this case resembling the Kolmogorov formula, but with @ replaced by Eq. (22b) from [2J: 
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Note that while © resembles the Kolmogorov formula, it is not the same. Both formulas 
derive from the finite dimensional versions of the Kolmogorov formula, that is, the finite order 
rate-distortion functions. But the mechanics of taking the limit from the finite order results 
differ in a critical way in that the integrand is bound away from zero, as will be described in 
more detail later. The equivalence of the two formulas follows in the stationary case because 
of the existence of source coding theorems for each, but it does not follow in the nonstationary 
case. 

Let Rk denote the Kolmogorov formula of (|4|) for stationary Gaussian processes applied to 
the autoregressive case, and let -Rar denote the autoregressive formula of ([6]) and define the 
subset E = {lo : g(u) < a 2 / '9} of [— 7r,7r]. Then 
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and hence the two formulas will not agree unless the final integral is 0. 



In 1980 Hashimoto and Arimoto [3j revisited the question of the rate-distortion in the 
nonstationary case. They considered the finite order autoregressive case and noted that both the 
source coding-theorem and the evaluation of the rate-distortion function had been accomplished 
for the Wiener process in [Tj, but they only described the source coding theorem and not the 
rate distortion function of [2] for the more general autoregressive case, stating that "the rate- 
distortion function has not been calculated for nonstationary processes except for the Wiener 
process" and presented an "example which shows the form (3) is incorrect if the process is 
not asymptotically stationary, and we present the exact form of the rate-distortion in the 
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next section." Their equation (3), however, corresponds to the Kolmogorov form of © and 
not the autoregressive form of ([6]), so that their example provided a demonstration that the 
Kolmogorov formula fails in the nonstationary case, but not that there was a problem with the 
autoregressive result © of [2]. As a result, there has been some confusion about the validity 
of the rate-distortion function of [2] in the nonstationary case and the apparently different 
result provided in [3j as well as some confusion about applicability of the specific asymptotic 
eigenvalue results for Toeplitz matrices used in [2]. 

We here reconcile the two forms for the nonstationary case and demonstrate that they are 
indeed consistent and distinct from the Kolmogorov formula in the nonstationary case. We also 
remark on some related issues regarding the eigenvalue distributions of certain asymptotically 
Toeplitz matrices. 



2 Nonstationary autoregressive processes revisited 

For the M th-order autoregressive process (a*. = for k > M) , Hashimoto and Arimoto correctly 
point out that the Kolmogorov formula (their (3)) fails for a simple first order nonstationary 
autoregressive source and they state their main result, which replaces by the formula 

M 
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where p k are the zeros of the characteristic polynomial 
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A{z) = Y,a k z- k . (9) 

k=0 

Suppose that \pi\ > | • • ■ \p m \ > 1 > |p m +i| " " ' — \Pm\- Then, this can be rewritten as 

m ^ 

R(D e ) = R K + Y J -In \a k | 2 (10) 

k=l 

where a k are the roots of A(z) outside the unit circle. 

On the other hand, the Jacobi-Jensen formula for analytic functions (e.g., [6], p. 23, or [7], 
p. 207) applied to (0) yields 



Rar = i? K + ^^hi|a fc | 2 , (11) 

k=l 

which agrees with the rate-distortion function of (|10|) . Note that since g(to) is analytic, it can 
have at most a finite number of zeros outside the unit circle. Thus, in particular, the results of 
[3] demonstrate that the Kolmogorov formula may fail for nonstationary sources, not that the 
autoregressive formula is incorrect. The two formulas agree for stationary sources and for the 
nonstationary Wiener process. 



3 Asymptotic eigenvalue distributions 

Although the rate-distortion functions of [2 J and [3 J are equivalent, they use different versions of 
the classic asymptotic eigenvalue distribution theorem for Toeplitz matrices. The classic form 
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can be described as follows. Given a discrete-time Fourier transform pair 
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let T n = {tk~e',k,l = 0, 1, ...,n — 1} be the corresponding Toeplitz matrix with eigenvalues 
T n,k] k = 0, 1, . . . , n — 1. Suppose that the essential infimum and supremum of / by mj and My, 
respectively. Then the classical theorem states that if F is a continuous function on [mf,Mf] 
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If any sequence of matrices B n is asymptotically equivalent to T n in the sense of being bounded 
and having a vanishing Hilbert-Schmidt norm \B n — T n \, then (|14p will also hold for its eigen- 
values. 

The classic Kolmogorov result for stationary autoregressive processes follows from his finite 
order results by taking T n as the nth order covariance matrix of the Gaussian process, tk-j = 
Kx(k,j), and using the Toeplitz limit to compute 
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The autoregressive result, however, instead focuses on the inverse covariance. The difference 
equation defining an autoregressive process can be written in vector form as 



A n X n = Z r 



where the lower triangular Toeplitz matrix A n is given by 
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The inverse covariance 



(4V 



a A n A n . 



(18) 

is then asymptotically equivalent to the Toeplitz matrix T n (g(uj)/a 2 ), where g(co) is given by 
([5]) and hence the Toeplitz eigenvalue distribution theorem can be applied with r ni k = 1/A n .fe, 
where the X n ^ are the eigenvalues of a~ 2 A^A n . 

As Hashimoto and Arimoto point out, in the nonstationary case direct application of the 
asymptotic eigenvalue distribution theorem does not work in evaluating the limit of (|16|) because 
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of the behavior of the X Ut k near zero. Alternatively, lnr is not continuous at r = and hence 
the conditions of the eigenvalue distribution theorem do not apply. This difficulty is obvious 
from rewriting (|16p as 

I n - 1 ( i i \ 
R(D e )= lim -Vmax 0, - In — — 

since the are not bound away from 0. The observation in [3] is that exactly the m smallest 
decrease exponentially as n increases while the remaining are bounded from zero. 
Between those m smallest \ n ,k, the £th smallest one decreases asymptotically as \pi\ n , for 
£ = 1, 2, • ■ ■ , m, and the expression (JSj) follows. 

The derivation of [2], however, avoided the above difficulty by deriving an equivalent form 
to the Kolmogorov finite order formula: 
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and then applying a variation of the Toeplitz eigenvalue theorem for functions that are confined 
to the region where the eigenvalues are bounded away from 0, that is, the Toeplitz theorem is 
applied not to the function F(X) = max(0, \ lnl/A#) as in the classic Kolmogorov case, but to 
the function F(X) = max [cr 2 /A, cr 2 /6~\ (see the discussion between (21) and (22) in [2]). This 
yields an answer with a different functional form which is not contradicted by the example of 
[3] and which has no problems with X n k near 0. 

This trick of truncating both the sum and integral to avoid problems at singularities has 
been well developed in the literature and the general result is mentioned below as an indication 
of how singularities such as arise in nonstationary autoregressive processes are easily handled 
in the asymptotic eigenvalue distribution theory. 

In some applications we wish to study the asymptotic distribution of a function F{r n k) of 
the eigenvalues of an asymptotically Toeplitz sequence of matrices that is not continuous at 
the minimum or maximum value of /. For example, in order for results derived to apply to 
the function F(f(X)) = 1//(A) which arises when treating inverses of Toeplitz matrices, it is 
often considered necessary to require that the essential infimum m/ > because the function 
F(l/x) is not continuous at x = 0. If rrif = 0, the basic asymptotic eigenvalue distribution 
breaks down and the limits and the integrals involved might not exist — the limits might exist 
and equal something else or they might simply fail to exist. 

In order to treat the inverses of Toeplitz matrices when / has zeros, define the mid function 

' z, y>z 

y, x <y < z (20) 
x, y<z 

The following result was proved in [9] and extended in [TO] . See also [TTJ [121 (HI E] ■ 

Theorem 1 Suppose that f is in the Wiener class. Then for any function F(x) continuous on 
[iP,9] C [mf,Mf] 

lim - VF(mid(^, r nik , 6)) = — F(mid(^, /(A), 0)d\. (21) 
k=0 J[J 



mid(x, y, z) = 
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This asymptotic eigenvalue distribution theorem yields the rate-distortion function in the non- 
stationary autoregressive case and avoids the singularity problems encountered by [3]. It is 
stated in [3] that the asymptotic eigenvalue distributions of T n {g) and a~ 2 A* n A n are not the 
same and this fact is demonstrated by the example of a first order nonstationary autoregressive 
Gaussian process which violates (jU), but this is only true in the strict sense that the traditional 
eigenvalue distribution theorem does not hold for the function F considered. These two matrix 
sequences, however, are asymptotically equivalent and their eigenvalue distributions do sat- 
isfy the truncated form of Theorem [TJ Thus the eigenvalues are indeed asymptotically equally 
distributed, provided they are cut off at suitable values. 

In conclusion, the results of [2] and [3] are consistent and the results of the latter provide 
no evidence of invalidity of the former. The two papers provide alternative characterizations 
of the same quantity which are related through the Jacobi-Jensen formula. The second paper 
provided the first detailed example where the Kolmogorov and autoregressive formulas for the 
rate-distortion function differed by a nonzero amount. 
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